Zero-Inflated Poisson and Zero-Inflated Negative Binomial Models Using the COUNTREG Procedure
نویسندگان
چکیده
Real-life count data are frequently characterized by overdispersion and excess zeros. Zero-inflated count models provide a parsimonious yet powerful way to model this type of situation. Such models assume that the data are a mixture of two separate data generation processes: one generates only zeros, and the other is either a Poisson or a negative binomial data-generating process. The result of a Bernoulli trial is used to determine which of the two processes generates an observation. OVERVIEW The COUNTREG (count regression) procedure analyzes regression models in which the dependent variable takes nonnegative integer or count values. The dependent variable is usually the number of times an event occurs. Some examples of event counts are: number of claims per year on a particular car owner’s auto insurance policy number of workdays missed due to sickness of a dependent in a 4-week period number of papers published per year by a researcher In count regression, the conditional mean E.yi jxi / of the dependent variable, yi , is assumed to be a function of a vector of covariates, xi . Possible covariates for the auto insurance example are: age of the driver type of car daily commuting distance MARGINAL EFFECTS IN COUNT REGRESSION Marginal effects provide a way to measure the effect of each covariate on the dependent variable. The marginal effect of one covariate is the expected instantaneous rate of change in the dependent variable as a function of the change in that covariate, while keeping all other covariates constant. Unlike in linear models, the derivative of the conditional expectation with respect to xi;j is no longer equal to ˇj—that is, @E.yi jxi /=@xi;j ¤ ˇj . For example, for the Poisson regression with E.yi jxi / D ex 0 i ˇ is @E.yi jxi / @xi;j D ˇj e x i ˇ D ˇjE.yi jxi / (1) Therefore the marginal effect of the change in covariate xi;j depends not only on ˇj , but also on all other estimated coefficients, and on all other covariate values. Another interpretation is that a one-unit change in the j th covariate leads to a proportional change in the conditional mean E.yi jxi / of ˇj . BASIC MODELS: POISSON AND NEGATIVE BINOMIAL REGRESSION MODELS The Poisson (log-linear) regression model is the most basic model that explicitly takes into account the nonnegative integer-valued aspect of the dependent count variable. In this model, the probability of an event count yi , given the vector of covariates xi , is given by the Poisson distribution: P.Yi D yi jxi / D e i yi i yi Š ; yi D 0; 1; 2; : : :
منابع مشابه
Hurdle, Inflated Poisson and Inflated Negative Binomial Regression Models for Analysis of Count Data with Extra Zeros
In this paper, we propose Hurdle regression models for analysing count responses with extra zeros. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset. In this example, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...
متن کاملZero inflated Poisson and negative binomial regression models: application in education
Background: The number of failed courses and semesters in students are indicatorsof their performance. These amounts have zero inflated (ZI) distributions. Using ZI Poisson and negative binomial distributions we can model these count data to find the associated factors and estimate the parameters. This study aims at to investigate the important factors related to the educational performance of ...
متن کاملAssessment of length of stay in a general surgical unit using a zero-inflated generalized Poisson regression
Background: The effective use of limited health care resources is of prime importance. Assessing the length of stay (LOS) is especially important in organizing hospital services and health system. This study was conducted to identify predictors of LOS among patients who were admitted to a general surgical unit. Methods: In this cross-sectional study, the sample included all patien...
متن کاملمقایسه مدل شبکه عصبی مصنوعی با مدلهای رگرسیونی دادههای شمارشی در پیش بینی تعداد دفعات اهدای خون
Background: Modeling is one of the most important ways for explanation of relationship between dependent and independent response. Since data, related to number of blood donations are discrete, to explain them it is better to use discrete variable distribution like Poison or Negative binomial. This research tries to analyze numerical methods by using neural network approach and compare ...
متن کاملZero-inflated negative binomial modeling, efficiency for analysis of length of maternity hospitalization
Background: Mothers’ delivery is one of the most common hospitalization factors throughout the world and it’s modeling can explain distribution and effective factors on rising and decreasing of it. The objective of the present study was a suitable modeling for mother hospitalization time and comparing it with different models. Materials & Methods: Present study is an observational and cross-s...
متن کاملModeling the Number of Attacks in Multiple Sclerosis Patients Using Zero-Inflated Negative Binomial Model
Background and aims: Multiple sclerosis (MS) is an inflammatory disease of the central nervous system.The impact of the number of attacks on the disease is undeniable. The aim of this study was to analyze thenumber of attacks in these patients.Methods: In this descriptive-analytical study, the registered data of 1840 MS patients referred to the MS clinicof Ayatollah Kash...
متن کامل